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Abstract — A cellular multiple-input multiple-output (MIMO) 
downlink system is studied in which each base station (BS) 
transmits to some of the users, so that each user receives its 
intended signal from a subset of the BSs. This scenario is referred 
to as network MIMO with partial cooperation, since only a subset 
of the BSs are able to coordinate their transmission towards any 
user. The focus of this paper is on the optimization of linear 
beamforming strategies at the BSs and at the users for network 
MIMO with partial cooperation. Individual power constraints at 
the BSs are enforced, along with constraints on the number of 
streams per user. It is first shown that the system is equivalent to 
a MIMO interference channel with generalized linear constraints 
(MIMO-IFC-GC). The problems of maximizing the sum-rate 
(SR) and minimizing the weighted sum mean square error 
(WSMSE) of the data estimates are non-convex, and suboptimal 
solutions with reasonable complexity need to be devised. Based on 
this, suboptimal techniques that aim at maximizing the sum-rate 
for the MIMO-IFC-GC are reviewed from recent literature and 
extended to the MIMO-IFC-GC where necessary. Novel designs 
that aim at minimizing the WSMSE are then proposed. Extensive 
numerical simulations are provided to compare the performance 
of the considered schemes for realistic cellular systems. 



I. Introduction 

Interference is known to be major obstacle for realizing 
the spectral efficiency increase promised by multiple-antenna 
techniques in wireless systems. Indeed, the multiple-input 
multiple-output (MIMO) capacity gains are severely degraded 
and limited in cellular environments due to the deleterious 
effect of interference |0Q,ED. Therefore, network-level inter- 
ference management appears to be of fundamental importance 
to overcome this limitation and harness the gains of MIMO 
technology. Confirming this point, multi-cell cooperation, also 
known as network MIMO, has been shown to significantly 
improve the system performance [3 1. 
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Network MIMO involves cooperative transmission by mul- 
tiple base stations (BSs) to each user. Depending on the level 
of multi-cell cooperation, network MIMO reduces to a number 
of scenarios, ranging from a MIMO broadcast channel (BC) 
[ID in case of full cooperation among all BSs, to a MIMO 
interference channel [5 1,[6| in case no cooperation at the BSs 
is allowed. In general, network MIMO allows cooperation only 
among a cluster of BSs for transmission to a certain user [j7], 
[8 1 (see also references in Q). 

In this paper, we consider a MIMO interference channel 
with partial cooperation at the BSs. It is noted that all BSs 
cooperating for transmission to a certain user have to be 
informed about the message (i.e., the bit string) intended for 
the user. This can be realized using the backhaul links among 
the BSs and the central switching unit. We focus on the sum- 
rate maximization (SRM) and on the minimization of weighted 
sum-MSE (WSMSE) under per-BS power constraints and con- 
straints on the number of streams per user. Moreover, although 
non-linear processing techniques such as vector precoding [9|, 
[10 1 may generally be useful, we focus on more practical 
linear processing techniques. Both the SRM and WSMSE 
minimization (WSMMSE) problems are non-convex 11111 . and 
thus suboptimal design strategies of reasonable complexity are 
called for. 

The contributions of this paper are as follows: 

(i) It is first shown in Sec. III-AI that network MIMO with 
partial BS cooperation, that is, with partial message 
knowledge, is equivalent to a MIMO interference channel 
in which each transmitter knows the message of only one 
user under generalized linear constraints, which we refer 
to as MIMO-IFC-GC; 

(ii) We review the available suboptimal techniques that have 
been proposed for the SRM problem lfT2l - lfT4l and extend 
them to the MIMO-IFC-GC scenario where necessary in 
Sec. [V] Since these techniques are generally unable to 
enforce constraints on the number of streams, we also 
review and generalize techniques that are based on the 
idea of interference alignment [5] and are able to impose 
such constraints; 

(iii) Then, we propose two novel suboptimal solutions for the 
WSMMSE problem in Sec.[V]]under arbitrary constraints 
on the number of streams. It is noted that the WSMMSE 
problem without such constraints would be trivial, as it 
would result in zero MMSE and no stream transmitted. 
The proposed solutions are based on a novel insight 
into the single-user MMSE problem with multiple linear 
constraints, which is discussed in Sec. II VI 
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(iv) Finally, extensive numerical simulations are provided 
in Sec. I VIII to compare performance of the proposed 
schemes in realistic cellular systems. 

Linear MMSE precoding and equalization techniques pro- 
posed in this paper were discussed briefly in [15 1. The detailed 
analysis and discussion (including the proofs to the lemmas) 
are included in this paper. Additionally, we have also reviewed 
and extended available solutions to the SRM problem. Fur- 
thermore, we have included discussions of the complexity and 
overhead of the proposed techniques and previously available 
(and/or extended) solutions. 

Notation: We denote the positive definite matrices as A >; 
0. [•] denotes max(-,0). Capital bold letters represent ma- 
trices and small bold letters represent vectors. We denote 
the transpose operator with (-) T and conjugate transpose 
(Hermitian) with (-) H . A~2 represents the inverse square of 
positive definite matrix A. 

II. System Model and Preliminaries 

Consider the MIMO downlink system illustrated in Fig. 
1 with M base stations (BSs) forming a set A4, and K 
users forming a set K,. Each BS is equipped with nt transmit 
antennas and each mobile user employs n r receive antennas. 
The mth BS is provided with the messages of its assigned 
users set JC m C JC. In other words, the fcth user receives 
its message from a subset of Mk BSs A4 k C X. Notice 
that, if JC m contains one user for each transmitter m and 
.Mfc = 1, then the model at hand reduces to a standard 
MIMO interference channel. Moreover, when all transmitters 
cooperate in transmitting to all the users, i.e., JC m = JC for 
all m £ AA or equivalently Mk = M, then we have a MIMO 
broadcast channel (BC). 

We now detail the signal model for the channel at hand, 
which is referred to as MIMO interference channel with partial 
message sharing. Define as u^. = [uk.i ■ ■ -Uk.d k ] T G C dk the 
dk x 1 complex vector representing the dk < mm(Mknt, n r ) 
independent information streams intended for user k. We 
assume that Ufe ~ CAf (0,1). The data streams are known 
to all the BS in the set AAk- In particular, if m £ Mk, the mth 

BSs Users 




Fig. 1. A downlink model with partial BS cooperation or equivalently partial 
message knowledge. 



BS precodes vector via a matrix Bj. m £ C™* *, so that 
the signal x m £ C"' sent by the mth BS can be expressed as 

x TO = Bfc :I „Ufc. (1) 

Imposing a per-BS power constraint, the following constraint 
must be then satisfied 

E[||x m || 2 ] =tr{E[x m x^]} (2) 

= J2 tr i B fe.™ B fe,m} <P m ,m = l,...,M, 
keic m 

where P m is the power constraint of the mth BS. 
The signal received at the fcth user can be written as 

M 

y k = ^ Hfc, m x m + rife (3a) 

m— 1 

= ^ Hfc, m Bfe im Ufe + ^ ^ H; B/ m . 

meM k l^k jeMi 

(3b) 

where Hfe m € C" rX ™ i is the channel matrix between the ?7ith 
BS and fcth user and is additive complex Gaussian noise 
rife CAf(0,lj^. We assume ideal channel state information 
at all nodes. In dJbi i, we have distinguished between the first 
term, which represents useful signal, the second term, which 
accounts for interference, and the noise. 

A. Equivalence with MIMO-IFC-GC 

We now show that the MIMO interference channel with 
partial message sharing and per-BS power constraints de- 
scribed above is equivalent to a specific MIMO interference 
channel with individual message knowledge and generalized 
linear constraints, which we refer to as MIMO-IFC-GC. 

Definition 1: {MIMO-IFC-GC) The MIMO-IFC-GC con- 
sists of K transmitters and K receiver, where the fcth transmit- 
ter has rrit : k antennas and the fcth receiver has m r> & antennas. 
The received signal at the fcth receiver is 

Yk = Hfe, fe Xfe +^Hfe j; x ; +n k , (4) 

l^k 

where rife ~ CAf (0,1), the inputs are Xfe e C m * fc and the 
channel matrix between the Zth transmitter and the fcth receiver 
is Hfe^ e C"v jfc xm t ,fc_ jYie data vector intended for user fc is 
Ufe £ C dk with dk < min(m t .fc, m r .fc) and ~ CAf (0,1). 
The precoding matrix for user fc is defined as Bj. £ C mt - kXdk 
so that Xfe = BfcUfe. The inputs x^ have to satisfy M 
generalized linear constraints 

K K 

^tr{* fe , ro E [x fe x£]} =^tr{# fc , m BfcB£} < P m , 

k=l k=l 

(5) 

for given weight matrices &k,m £ C mtkXmt ' k and m 

1 M. The weight matrices are such that matrices 

^ j &k,m are positive definite for all k = 1, . . . ,K. 

In case the noise is not uncorrelated across the antennas, each user 
can always whiten it as a linear pre-processing step. Therefore, a spatially 
uncorrelated noise can be assumed without loss of generality. 
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We remark that the positive definiteness of matrices 
^ , &k.m guarantees that the system is not allowed to 
transmit infinite power in any direction ifTB") . 

Lemma 1: Let (/)/. be the Zth BS in subset M. k of BSs that 
know user fc's message. The MIMO interference channel with 
partial message sharing (and per-transmitter power constraints) 
is equivalent to a MIMO-IFC-GC. This equivalent MIMO- 
IFC-GC is defined with mt,k — M^rit, rn Ttk = n r , channel 
matrices 



H 



k.l 



Hfe 



(i)/ 



beamforming matrices 



B 



k = 



B I(1) 



' " a k,(M k ) h 



(6) 



(7) 



and weight matrices <&fc >m being all zero except that their Zth 
n t x n t submatrix on the main diagonal is I„ t , if m = (l) k (If 
k ^ JC m then &k,m — 0)- We emphasize that the definition of 
MIMO-IFC-GC and this equivalence rely on the assumption 
of linear processing at the transmitters. 

Proof: : The proof follows by inspection. Notice that 
matrices 2~2m=i ^k.m are positive definite by construction. □ 
Given the generality of the MIMO-IFC-GC, which includes 
the scenario of interest of MIMO interference channel with 
partial message sharing as per the Lemma above, in the 
following we focus on the MIMO-IFC-GC as defined above 
and return to the cellular application in Sec. lVIII It is noted that 
a model that subsumes the MIMO-IFC-GC has been studied 
in Ifl6l , as discussed below. 

B. Linear Receivers and Mean Square Error 

In this paper, we focus on the performance of the MIMO- 
IFC-GC under linear processing at the receivers. Therefore, 
the fcth receiver estimates the intended vector Ufe using the 
receive processing (or equalization) matrix Afe £ C d fc xm >-.fc 
as 

u k =A^y k . (8) 

The most common performance measures, such as weighted 
sum-rate or bit error rate, can be derived from the estimation 
error covariance matrix for each user fc, 



E fe =E 



(u fc - u fe ) (u& - u fe ) 



(9) 



which is referred to as Mean Square Error (MSE)-matrix (see 
ifTTl for a review). The name comes from the fact that that the 
jth term on the main diagonal of Efe is the MSE 

MSE fc j = E[\u k ^~u k J 2 } (10) 

on the estimation of the kth user's jth data stream u k j. Using 
the definition of MIMO-IFC-GC, it is easy to see that the 
MSE-matrix can be written as a function of the equalization 
matrix Afe and all the transmit matrices {^k} k= i as 

Efe =A^HfeBfeB^H^ fc Afe — A^Hfe^Bfe 

- B^H^ fc A fc + A^OfcAfc + Ife. 



(11) 

where tt k is the covariance matrix that accounts for noise and 
interference at user k 



n fc = I + X]H fc ,,B I B z H H|? ii 

l^k 



(12) 



III. Problem Definition and Preliminaries 

In this paper, we consider the optimization of the sum of 
some specific functions f k (Efe) of the MSE-matrices Efe of 
all users k = 1, . . . , K for the MIMO-IFC-GC. Specifically, 
we address the following constrained optimization problem 

K 

minimize J2 A( E fe) 

Bfc,A fc ,Vfe k=l 
K 

subject to J2 tr {*fc. m BfeB£} < P m , m = l,...,M, 

fc=i 

(13) 

where the optimization is over all transmit beamforming 
matrices Bfe and equalization matrices Afe. Specifically, we 
focus on the weighted sum-MSE functions (WSMSE) 

dk 

h (Efe) = tr {WfeEfc} = J2 W W MSE fc.7 ( 14 ) 

i=i 

with given diagonal weight matrices Wfe 6 i£d, k xd k w jj ere 
main diagonal of Wfe is given by [tVk,i, ■■■,W).,d k ] with non- 
negative weights w k j > 0. With cost function ( TBI ), we refer 
to the problem ( fT3l as the weighted sum-MSE minimization 
(WSMMSE) problem. 

Of more direct interest for communications systems is the 
maximization of the sum-rate. This is obtained from (Tl3T > by 
selecting the sum-rate (SR) functions 



jfc(Efe) = log | Efe | 



(15) 



With cost function dl5l ), problem dT3l > is referred to as the sum- 
rate maximization (SRM) problem. In fact, from information- 
theoretic considerations, it can be seen that ([T5T l is the maxi- 
mum achievable rate (in bits per channel use) for the fcth user 
where the signals of the other users are treated as noise (see, 
e.g., 03). 

Remark 1: Consider an iterative algorithm where at each 
iteration a WSMMSE problem is solved with the weight ma- 
trices Wfe assumed to be non-diagonal and selected based on 
the previous MSE-matrix Efe. This algorithm can approximate 
the solution of ( fT3T > for any general cost function f k (E k ). 
This was first pointed out in ITSl for the weighted SRM 
problem in a MIMO BC, then in lfl9ll for the single-antenna 
interference channel and a general utility function, and has 
been generalized to a MIMO (broadcast) interference channel 
in ll20ll with conventional power constraints. It is not difficult to 
see that this result extends also to the MIMO-IFC-GC, which 
is not subsumed in the model of [20| due to the generalized 
linear constraints. We explicitly state this conclusion below. 

Lemma 2 [20]: For strictly concave utility functions f k ( ) 
for all k, the global optimal solution of problem ([T3T l and the 
solution of 

K 

minimize £ {tr {WfeE fc } - tr {Wfe 5fe (Wfe)} 



B fc ,A fc ,W fc ,Vfc ^ 



-A(fffe(Wfe))} 



A' 



subject to J2 tr { * fc , m B fe B£ } < P m , m=l,...,M, 

fc=i 

(16) 

where gk(-) is the inverse function of the V/fe(-), are the same. 
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Consequently, in order to find an approximate solution of 

< f]~3T >. at each step matrices for k = 1 , K are updated 

by solving ( [ToT l with respect to only (i.e., we keep 

(Afe, Bfc), Vfc unchanged in this step). Then, using the obtained 
matrices Wfe, for k = 1, . . . , K, the problem ([Tol l reduces to 
a WSMMSE problem with respect to matrices At and Bfe 
for k = 1,...,K (i.e., matrices Wfe are kept fixed). This 
results in the iterative algorithm, that is discussed in Remark 
1 and that leads to a suboptimal solution of ( fT3l >. In the special 
case of the SRM problem, we have /fc(Efc) = log|Efc| and 
5fc(Wfc) = Wr , in which problem dT6b is then equivalent to 
the problem 

K K 

minimize J2 tr{W fc E fe }- log|W&| 

B fc ,A fc ,W fc ,Vfc k=1 k=1 
K 

subject to J2 tr{* fc . m B fe BH} < P m , m = 1, . . . ,M. 

k=l 

(17) 

The optimization problem (fTTT i can be solved in an iterative 
fashion, where at each iteration the weights are selected as 
W* = E^ 1 and then the WSMMSE problem is solved with 
respect to matrices (A/., Bj.) for k = 1, . . . , K. 



IV. The Single-User Case (K = 1) 

The WSMMSE and SRM problems are non-convex and thus 
global optimization is generally prohibitive. In this section, we 
address the case of a single user (K — 1). In particular, the 
SRM problem with K = 1 is non-convex if one includes 
constraints on the number of streams d\, but is otherwise 
convex and in this special case can be solved efficiently ifTTl . 
The global optimal solution for the single-user problem with 
multiple linear power constraint (and a rank constraint) is still 
unknown [21 ]. The WSMMSE problem is trivial without rank 
constraint, as explained above, and is non-convex. Here we 
first review a key result in ifTTIEl that shows with K = 1 
and a single constraint (M = 1) the solution of the WSMMSE 
problem can be, however, found efficiently. We then discuss 
that with multiple constraints (M > 1), this is not the case, and 
a solution of the WSMMSE problem even with K = 1 must 
be found through some complex global optimization strategies. 
One such technique was recently proposed in [21] based on 
a sophisticated gradient approach. At the end of this section 
we then propose a computationally and conceptually simpler 
solution based on a novel result (Lemma 5), that our numerical 
result have shown to have excellent performance. This will be 
then leveraged in Sec. IVI-BI to propose a novel solution for 
the general multiuser case. 

To elaborate, consider a scenario where the noise-plus- 
interference matrix 17^ (fT2l is fixed and given (i.e., not subject 
to optimization). Now, we solve the WSMMSE problem with 
K = 1 for specified weight matrices W and <& m . For the rest 
of this section, we drop the index k = 1 from all quantities for 
simplicity of notation. We proceed by solving the problem at 
hand, first with respect to A for fixed B, and then with respect 
to B without loss of optimality. The first optimization, over 
A, is easily seen to be a convex problem (without constraints) 
whose solution is given by the minimum MSE equalization 



matrix 



A = (HBB H H H + Jl) ^B. 
Plugging ( [T8l in the MSE matrix (JTTJ. we obtain 



(18) 



E 



I + B H H H f2 X HB 



(19) 



We now need to optimize over B the following problem 
tr |\V (I + B H H H J1 _1 HB) _1 | 



minimize 

B 



subject to tr {* m BB H } < P m , m = 1, . . . , M 



(20) 



Consider first the single-constraint problem, i.e., M = 1. 
The global optimal solution for single-user WSMMSE prob- 
lem with M = 1 is given in [22] [21] and reported below. 
Recall that, according to Definition Q] matrix <&i is positive 
definite. 

Lemma 3 1221: The optimal solution of the WSMMSE 
problem with K = 1 and a single trace constraint (M = 1) is 
given by 

B = $^US, (21) 

where U E c«i*xd j s t jj e ma tnx of eigenvectors of matrix 
& 1 2 Hf2~ 1 H H <I> 1 2 corresponding to its largest eigenvalues 
Ti > • • • > Jd an d S is a diagonal matrix with the diagonal 
terms Vp7 defined as 



Pi 




i + 



(22) 



with /i > being the "waterfilling" level chosen so as to 
satisfy the single power constraint tr {<I>iBB H } = Pi. 

Proof: Introducing the "effective" precoding matrix B = 

1/2 ~ — - 

B and "effective" channel matrix H = 2 , the 

problem is equivalent to the one discussed in 1221 Theorem 
1]. □ 

In the case of multiple constraints the approach used in 
Lemma 3 cannot be leveraged. Here we propose a simple, 
but effective, approach, which is based on the following 
considerations summarized in the following two lemmas. 

Lemma 4: The precoding matrix (l2"H - (l22t for a given fixed 
/i > minimizes the Lagrangian function 

C(B-fi) =tr jw(l + B H *^H H rj~ 1 H*^B 



^itr{BB H } 



(23) 



where B is the effective precoding matrix defined above. 

Proof: We first note that (l23l is the Lagrangian func- 
tion of the single-user single-constraint problem solved in 
Lemma 2. Then, we prove (l23l by contradiction. Assume 
that the minimum of the Lagrangian function is attained at 
where the corresponding E is not diagonal. Then, one can 
always find a unitary matrix Q G c dxd sucn that the matrix 
B* = BQ diagonalizes E since with B* we have E = 

Q H (i + B H *^H H rj- 1 H*^B) Q [22]. The function 
tr {WE} is Schur concave, and therefore the matrix B* does 
not decrease the function tr {WE} with respect to B, while 
BB H = B*B* H . This implies that the minimum of tr{WE} 
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is reached when the MSE matrix is diagonalized. Therefore, 
we can set without loss of generality B = US where U is 
defined as in Lemma 3 and X is diagonal with non-negative 
elements on the main diagonal. Substituting this form of B 
into the Lagrangian function, we obtain a convex problem 
in the diagonal elements of S, whose solution can be easily 
shown to be given by ( 1221 ) for the given /i. This concludes the 
proof. □ 
Lemma 5: Let p* be the optimal value of the 
single-user WSMMSE problem with multiple constraints 
(K = 1, M > 1). We have 



p* > maxinf £(B: A), 

A>0 B 



(24) 



where 



£(B;A) =tr jw(l + B H H H fJHB) *} 

M 

+ ^ A m (tr { * m BB H } - P m ) (25) 



m — 1 



is the Lagrangian function of the single-user WSMMSE prob- 
lem at hand and A = (Ai, . . . , Am)- Moreover, if there exists 
an optimal solution B achieving p* that, together with a strictly 
positive Lagrange multiplier A > 0, satisfies the conditions 



tr 



V B £ - 0, 
{* m BB H } 



= P m , Vm 



(26) 
(27) 



then (l24l holds with equality. 

Proof: The proof is given in the appendix. □ 

Lemma 5 suggests that to solve the single-user multiple- 
constraint problem, under some technical conditions, one can 
minimize instead the dual problem on the right-hand side of 
( 1241 . Lemma 3 showed that this is always possible with a 
single constraint. The conditions in Lemma 5 hold in most 
cases where the power constraints for the optimal solution 
are satisfied with equality. While this may not be always the 
case, in practice, e.g., if the power constraints represent per- 
BS power constraints as in the original formulation of Sec. II, 
this condition can be shown to hold 11231 . 

Inspired by Lemma 5, here we propose an iterative approach 
to the solution of the WSMMSE problem with K = 1 that is 
based on solving the dual problem maxA>o mine £(B; A). 
Specifically, in order to maximize infe £(B; A) over A > 0, 
in the proposed algorithm, the auxiliary variables A is updated 
at the jth iteration via a subgradient update given by lfl6l 

= A^r 1} + S (P m - tr {* m BB H }) , Vm, (28) 

so as to attempt to satisfy the power constraints. Hav- 
ing fixed the vector problem miriB£(B,A) reduces 
to minimizing (|23j with *i = *(A (j) ) = £ m A m*m 
and fi = 1. This can be done using Lemma 3, so that 
from (|2T1i-(|221. at the jth iteration, BW is obtained as 
$(A^) — ^U^SW) where U^' is the matrix of eigenvectors 
of matrix f (A' 3 ')"2HH h $(A'^)"5 corresponding to its 

largest eigenvalues 71 > • • • > 7d and S^j is a dia gonal 

+ 

matrix with the diagonal terms Jpl = 



V. Sum-Rate Maximization 

The SRM problem for a number of users K > 1 is non- 
convex even when removing the constraints on the number 
of streams per user. The general problem in fact remains 
non-convex and is NP-hard [24|. Therefore, since finding the 
global optimal has prohibitive complexity, one needs to resort 
to suboptimal solutions with reasonable complexity. In this 
section, we review several suboptimal solutions to the SRM 
problem that have been proposed in the literature. Since some 
of these techniques were originally proposed for a scenario 
that does not subsume the considered MIMO-IFC-GC, we also 
propose the necessary modifications required for application 
to the MIMO-IFC-GC. Note that these techniques perform an 
optimization over the transmit covariance matrices by relaxing 
the rank constraint due to the number of users per streams 
(see discussion below). Therefore, we also review and modify 
when necessary a different class of algorithms that solve 
problems related to SRM but are able to enforce constraints on 
the number of transmitting streams per user. The WSMMSE 
problem does not seem to have been addressed previously for 
the MIMO-IFC-GC and will be studied in the next section. 



A. Soft Interference Nulling 

A solution to the SRM problem for the MIMO-IFC-GC was 
proposed in lfl2l . In this technique the optimization is over 
all transmit covariance matrices = B^Bfe e C" 1 *''''" 1 '''. 
The constraints on the number of streams would impose a 
rank constraint on as rank(Sfc) = dk- Here, and in 
all the following reviewed techniques below, unless stated 
otherwise, such rank constraints are relaxed by assuming 
that the number of transmitting data streams is equal to the 
transmitting antennas to that user, i.e. dk — m t: k- From (TT~5T > 
and ( fT8l . we can rewrite the (negative) sum-rate as 



K 



E lo si E fei 



K 



fc=l 



fe=l 

+ log|tt fc |, 



log|O fc +H M .S fc H^ ( 



(29) 



where flj. is defined in (fT2l . Notice that it is often conve- 
nient to work with the covariance matrices instead of the 
beamforming matrices B&, since this change of variables may 
render the optimization problem convex as, for instance, when 
minimizing the first term only in (|29l . It can then be seen that 
the SRM problem is, however, non-convex due to the presence 
of the — log I life I term, which is indeed a concave function of 
the matrices 

An approximate solution is then be found in lfl2l via an 
iterative scheme, whereby at each (j + l)th iteration, given 
the previous solution Sir" the non-convex term — log |rX/-| is 
approximated using a first-order Taylor expansion as 



log 1 n k 



0)i 



l^k 



,(.;) 



H 



A:./ 



Sz-S 



(J) 



H 



k.i 



(30) 
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where l~i 
problem 



CO 



E 

l^k 



Since the resulting 



therefore not detailed. The scheme is referred to as "SDP 
relaxation" in the following. We refer to |[T3l for further 
details about the algorithm. 



A" 



minimize _ - log|O fc + H feife S fe H fe!fe | 



S fc ,fe=l,...,K 



fc=l 



subject to tr {* fc , m £ & } < P m , m = 1, . . . , M, 

(31) 

is convex, a solution can be found efficiently. Following the 
original reference [12], we refer to this scheme as "soft 
interference nulling". We refer to lil2l for further details about 
the algorithm. 

B. SDP Relaxation 

A related approach is taken in ifTJl for the SRM problem^] 
for a MIMO-IFC with regular per-transmitter, rather than 
generalized, power constraints. Similarly to the previous tech- 
nique, the optimization is over the transmit covariance matrices 
and under the relaxed rank constraints. In particular, the 
authors first approximate the problem by using the approach 
in lfT8l . Then, an iterative solution is proposed by linearizing 
a non-convex term similar to soft interference nulling as 
reviewed above. It turns out that such linearized problem can 
be solved using semi-definite programming (SDP). Specifi- 
cally, denoting with fil the matrix (fT2l corresponding to 



the solution Bjr ' at the previous iteration j, i.e., SlY 
I+E^fc Ha /H,' 15/ "l^ 1 ,. the SDP problem to be solved 
to find the solutions Bi" 1 for the (j + l)th iteration is given 
by 



CO 



K 



K 



minimize 



subject to 



£tr{Y fc }+ £tr{c fc 
S fc } < P m , 
Hfc ) jfcSj(.H fc k + O 



0> 



k=l 

E tr{* fe 

k=l 



and E fc y 0, k = 1, 



m = 1, . 
Y fe 



,M 



where 



W«=I + H fejfe S^H^r 



COttH o 0')-i 



>=0, 



(32) 



-CO 



ijtk \ I ) 

H^f'H^I + ^HuS^H^j H 4 , fe , (33) 

and Yfe is an auxiliary optimization variable, 
defined using the Schur complement as Yfe = 

W k n ( k 3) (H fe , fe £ fe H£ fe + n[ j) ^ to convert the original 
optimization problem to an SDP problem |fT3l . The derivation 
requires minor modifications with respect to ifPTI and is 



C. Polite Waterfilling 

Reference [16] studied the (weighted) SRM problem for a 
general model that includes the MIMO-IFC-GC. We review 
the approach here for completeness. Two algorithms are pro- 
posed, whose basic idea is to search iteratively for a solution 
of the KKT conditions 1 1 1 ] for the (weighted) SRM problem. 
Notice that, since the problem is non-convex, being a solution 
of the KKT conditions is only necessary (as proved in [16]) but 
not sufficient to guarantee global optimality. It is shown in [ 16 1 
that any solution Efe, k = 1, . . . , K, of the KKT conditions 
must have a specific structure that is referred to as "polite 
waterfilling", which is reviewed below for the SRM problem. 

Lemma 6 i [76l/ : For a given set of Lagrange multipliers A = 
(/iAi, /xAm), where /i > and At > for i = 1,...,M, 
associated with the M power constraints in (fT~3b . define the 
covariance matrices 

M 

n k = ^n$k, m + "''iSJ 1 ^- ( 34 ) 



m— 1 



with 



Sfe — — (fl k 1 — (Ofe + Hfe^fcSfeH^ k ) 



(35) 



An optimal solution Sfe, k — 1,...,K, of the SRM problem 
must have the "polite waterfilling" form 



£ fe = n fe 3 v fe p fe v£n fe ' 



(36) 



where the columns of Vfe are the right singular vectors of 
the "pre- and post- whitened channel matrix" f2 fe 2 Hfe.fef2 A , 2 
with ( fT2b for k = 1, . . . , K, and Pfe is a diagonal matrix with 
diagonal elements pk,i- The powers pk,i must satisfy 

+ 



Pk„ 



(37) 



1 1 

> Ik.;, 

where J^ik,i is the ith singular value of the whitened matrix 

O fc 2 Hfe,fef2 fc 2 . Parameter fi > is selected so as to satisfy 
the constraint 

M 

E 



A, 



K 

■E 

fc=i 



tr{*fc, m Sfe} < V A m P, 



M 

E 

m— 1 



(38) 



-More generally, the reference studies the weighted SRM problem. 



which implied by the constraints of the original problem (fT3l ). 
Moreover, parameters \ > are to be chosen so as to satisfy 
each individual constraint in ( fT3l . 

In order to obtain a solution k = 1, . . . , K, according 
to polite waterfilling form as described in Lemma 6, lTT6l 
proposes to use the interpretation of Ofe in (l34l > as the 
interference plus noise covariance matrix and Sfe in (|35i as 
the transmit covariance matrix both at the "dual" systemr. 

3 In the "dual" system the role of transmitters and receivers is switched, 
i.e., the fcth transmitter in the original system becomes the fcth receiver in the 
"dual" system. The channel matrix between the fcth transmitter and the Zth 
receiver in the dual system is given by H ; H fe . 
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Based on this observation, the algorithm proposed in |fT6l 
works as follows. At each jth iteration, first one calculates 
the covariance matrices X^ in the original system using the 
polite waterfilling solution of Lemma 6; then one calculates 
the matrices using again polite waterfilling in the dual 
system as explained above. Finally, at the end of each jth 
iteration, one updates the Lagrange multipliers as 



Etr{ 

\0+i) = \0) k=1 



Mi 1 



Pn 



(39) 



thus forcing the solution to satisfy the constraints of the SRM 
problem (fl3l l. For details on the algorithm, we refer to |fT6l . 

Remark 2: Other notable algorithms designed to solve the 
SRM problem for the special case of a MIMO-BC with 
generalized constraints are [25 1, [26|. As explained in |[T6ll . 
these schemes are not easily generalized to the scenario at 
hand where the cost function is not convex. As such, they 
will not be further studied here. 



D. Leakage Minimization 

While the techniques discussed above do not enforce con- 
straints on the number of stream per users, here we extend 
a technique previously proposed in ll27l that aims at aligning 
interference through minimizing the interference leakage and 
is able to enforce the desired rank constraints. It is known that 
this approach is solves the SRM problem for high signal-to- 
noise-ratio (SNR). In this algorithm, it is assumed that the 
power budget is divided equally between the data streams 
and the precoding matrix of user k from BS m is given as 
Bfe, m = J j^^B k , m where B kiTn is a n t x d k matrix of 

orthonormal columns (i.e. B^ m Bfc. m = I). The equalization 
matrices are also assumed to have orthonormal columns (i.e. 
A^A/j = I). Hence, there is no inter-stream interference for 
each user. Total interference leakage at user k is given by 



/= Vtr{A£Q fc A fe } 



(40) 



where Q fe = £^ fc T, m eM j ^-H fc , ro B fc , ro B^ TO H^ m . To 
minimize the interference leakage, the equalization matrix Afc 
for user k can be obtained as A k = Vd k (Qk) where Vd k (A) 
represents a matrix with columns given by the eigenvectors 
corresponding to the d k smallest eigenvalues of A. Now, for 
fixed matrices A^, the cost function (l40l can be rewritten as 



I = 



E E 



(41) 



^H^A^H^O Min- 



where Q fc , m = J2j^k,jeK„ 

imizing over the matrices leads to choosing Bk, m = 
Vd k (Qk,m)- The algorithm iterates until convergence. We refer 
to this scheme as "min leakage" in the following. 

4 In the original work [ 27 1 which is proposed for the interference channels, 
the algorithm iteratively exchanges the role of transmitters and receivers to 
update the precoding and equalization matrices similarly. 



E. Max-SINR 

Another algorithm called "max-SINR" has been proposed 
in ll27l which is based on the maximization of SINR, rather 
than directly the sum-rate. This algorithm is also able to 
enforce rank constraints. The max-SINR algorithm assumes 
equal power allocated to the data streams and attempts at 
maximizing the SINR for each stream by selecting the receive 
filters. Then, it exchanges the role of transmitter and receiver 
to obtain the transmit precoding matrices which maximizes the 
max-SINR. This iterates until convergence. A modification of 
this algorithm is given in ll28l by maximizing the ratio of 
the average signal power to the interference plus noise power 
(SINR-like) term. However, these techniques are only given 
for standard MIMO interference channels and not for MIMO- 
IFC-GC. 

VI. MSE Minimization 

In this section, we propose two suboptimal techniques 
to solve the WSMMSE problem. We recall that with the 
WSMMSE problem enforcing the constraint on dk is necessary 
in order to avoid trivial solutions. Performance comparison 
among all the considered schemes will be provided in Sec. I VIII 
for a multi-cell system with network MIMO. 

A. MMSE Interference Alignment 

A technique referred to as MMSE interference alignment 
(MMSE-IA) was presented in 1 19 1 for an interference channel 
with per-transmitter power constraints and where each receiver 
is endowed with a single antenna. Here we extend the approach 
to to the MIMO-IFC-GC. 

The idea is to approximate the solution of the WSMMSE 
problem by optimizing the precoding matrices B k 
followed by the equalization matrices A k and iterating 
the procedure. Specifically, initialize B^ arbitrarily. Then, at 
each iteration j: (i) For each user k, evaluate the equalization 

L k 



matrices using the MMSE solution ( fT8l ), obtaining At = 



H fefe B 



(J-1) B (J'-1)H 



k 



H 



k.k ' 



ft 



(i-i) 



Hfc,feB 



(i-i) 



where 



'I 



'I 



from tO we have it? 
(ii) Given the matrices A); , the WSMMSE problem becomes 

K 



minimize 

B fe , k=i,...,K k 



£tr{w fc E«} 



K 



subject to J2 tr {*fc,mB fe B£} < P m ,Vm e M 

k=l 



(42) 



where E[ is (fTTT i with Aj, in place of Afe. Fixing the 
equalization matrices A^\\/fc, this problem is convex in Bfe 

and can be solved by enforcing the KKT conditions. Therefore, 

(i) 

matrices B fe for the jth iteration can be obtained as follows. 
Lemma 7: For given equalization matrices A\?\ a solution 

0) u- 



B 



1, K, of the WSMMSE problem is given by 



K 



(43) 



s 



where ji m are Lagrangian multipliers satisfying 




Mro > (44) 
P m ) = (45) 



^ m B«B« H }<^ 



and the power constraints "%2k=i tr 
for all m. 

Once obtained the matrices using the results in Lemma 
7, the iterative procedure continues with the (j + l)th iteration. 
We refer to this scheme as extended MMSE-IA, or eMMSE- 
IA. 

Remark 3: The algorithm proposed above reduces to the 
one introduced in [19| in the special case of per- transmitter 
power constraints and single-antenna receivers. It is noted that 
in such case, problem (l42l can be solved in a distributed fash- 
ion, so that each transmitter k can calculate its matrix (more 
precisely vector, given the single antenna at the receivers) 
independently from the other transmitters. In the MIMO-IFC- 
GC, the power constraints couple the solutions of the different 
users and thus make a distributed approach infeasible. 

B. Diagonalized MMSE 

We now propose an iterative optimization strategy inspired 
by the single-user algorithm that we put forth in Sec. [IV] 
At the (j + l)th iteration, given the matrices obtained at 
the previous iteration, we proceed as follows. The weighted 
sum-MSE (TT4l with the definition of MSE-matrices ( fTTT > is a 
convex function in each A& and Bfe when (Bj, Aj),Vj ^ k 
are fixed. Nevertheless, it is not jointly convex in terms of 
both (Afc,Bfc). Inspired by Lemma 5 for the correspond- 
ing single-user problem, we propose a (suboptimal) solution 
based on the solution of the dual problem for calculation of 
(Afc,Bfc). To this end, we first obtain Afe as (fT~8T >. Then, 
we simplify the Lagrangian function with respect to Bfe 
by removing the terms independent of Bfe. Specifically, by 
defining Tfe = J2i^k H ; H feA;W;A ; H H;.fe, we have that the 
Lagrangian function at hand is given by 

£(Bfe;A) =tr{Wfe (I 

+ tr{TfeBfe.B^} 



B fc H fe,fe^fc ^fe^Bfe) 



tr 



BfeB 



(46) 



This Lagrangian function for user k is the same as the 
Lagrangian function (l25T l of single-user WSMMSE problem 
when *(A) is replaced with F fc (A) = T fc + ^A m *fc, m . 
Matrix Ffc(A) is non-singular and therefore, using the same 
argument as in the proof of Lemma 5, for a given Lagrange 
multipliers A and given other users' transmission strategies 
(A;,B;),Vi k, the optimal transmit precoding matrix can 
be obtained as 



Bfe =Ffc(A)-5Ufe£* 



(47) 



where Ufe 6 £,mt,k><d k j s t jj e eigenvectors of 
Ffe(A) _ 2H fc H fc r2,: 1 Hfe i feFfe(A) _ 2 corresponding to its 



largest eigenvalues 7^1 > 
matrices with the elements 



. . > 7fe A and S fc 
/pkj. given by 



Pk,- 



Ik,, 



1 

Ik,- 



is diagonal 



(48) 



with A >z being the Lagrangian multipliers satisfy the 
power constraints. Since this scheme diagonalizes the MSE 
matrices defined in (O, it is referred to as diagonalized MMSE 
(DMMSE). 

To summarize, the proposed algorithm at each iteration j 
(0 evaluates the transmit precoding matrices Br ' given other 
users' transmission strategies (Ap , Bp ) using fflD - 
(l48l i (//) updates the equalization matrices using the MMSE 
solution (fT8l : (Hi) updates the A via a subgradient update 



A" 



A^ +1) = A« + 5 P m - £tr {#fe, m Bfe.BH} 



(49) 



fe=i 



to satisfy the power constraints. 

Remark 4: In this paper, we assume perfect knowledge of 
channel state information (CSI). Therefore, each transmit- 
ter and receiver has sufficient information to calculate the 
resulting precoders and equalizers by running the proposed 
algorithms. Under this assumption, which is common to other 
reviewed works such as [12] [13], no exchange of precoder 
and equalizer vectors is required between the transmitters 
and receivers. Nevertheless, in practice, the CSI may only 
be available locally, in the sense that transmitter k knows 
channel matrices H/,fe, for all I = 1, . . . , K, whereas receiver 
k is aware of channel matrices Hfe j, for all I = 1,...,K. 
The proposed DMMSE and the reviewed PWF [[UllESD al- 
gorithms require, beside the local CSI, that the transmitter 
k has available also the interference plus noise covariance 
matrix, fife, and the current equalization matrices A/ for all 
I = 1, . . . , K in order to update the precoder for user k. Hence, 
to enable DMMSE and PWF with local CSI, exchange of the 
equalizer matrices is needed between the nodes. Similarly, 
the proposed eMMSEIA, and min leakage and Max-SINR 
algorithms [27], require the transmitters to know the equalizing 
matrices A; for I = 1, . . . , K at each iteration, in addition to 
the local CSI. Moreover, each receiver must know the current 
precoders B; for all I = 1, . . . , K . Therefore, the overhead for 
the proposed eMMSEIA and the min leakage and Max-SINR 
algorithms involves the exchange of equalizer and precoder 
matrices between the transmitters and receivers. However, 
these latter algorithms can also be adapted using the bi- 
directional optimization process proposed in [29]. This process 
involves bi-directional training followed by data transmission. 
In the forward direction, the training sequences are sent using 
the current precoders. Then, at the user receivers the equalizers 
are updated to minimize the least square error cost function. 
In the backward training phase, the current equalizers are used 
to send the training sequences and the precoders are updated 
accordingly. Finally, the SIN |H2 and SDP relaxation [B] 
techniques are applied in a centralized fashion (rather than by 
updating the transmitter and receiver for each user at each 
iteration), and they require centralized full knowledge of all 
channel matrices. 
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Remark 5: Reference [13] addresses the SRM problem for 
a MIMO-IFC with regular per- transmitter, rather than general- 
ized, power constraints. The problem is addressed by solving 
an SDP problem at each iteration. Moreover, the optimization 
is over the transmit covariance matrices and under the relaxed 
rank constraint. This enforces a constraint on the number 
of transmitted streams per user. References [H4l - |fl6l study 
the (weighted) SRM problem by decomposing the multiuser 
problem into single-user problems for each user. Each single- 
user problem is a standard single-user SRM problem with an 
additional interference power constraint. The approach used 
in lfT4l - lfT6l assumes that the number of transmitted streams 
is equal to n r . In this paper, we address WSMMSE problem 
and allow for an arbitrary number of streams (dk < n r ). 

Remark 6: Our algorithms consists of an inner loop, which 
solves the WSMMSE problem, and an outer loop, which is the 
subgradient algorithm to update A. The subgradient algorithm 
in the outer loop is convergent (with a proper selection of 
the step sizes [30]) due to the fact that the dual function 
infB£(B;A) is a concave function with respect to A ifTTI . 
The inner loops of the proposed algorithms in this paper (i.e. 
eMMSEIA and DMMSE) are convergent since the objective 
function decreases at each iteration. A discussion of the 
convergence for a special case of the eMMSEIA algorithm can 
be found in lfl9l . However, the original problem is non-convex 
and our solutions are only local minima. Nevertheless, the 
DMMSE algorithm is shown to converge to a local minimum 
with better performance compared to the previously known 
schemes in Sec. IVHI 

VII. Numerical Results 

We consider a hexagonal cellular system where each BS 
is equipped with n t transmit antennas and each user has n r 
receive antennas. The users are located uniformly at random. 
Two tiers of surrounding cells are considered as interference 
for each cluster. We consider the worst-case scenario for the 
inter-cluster interference, which will be the condition that 
interfering BSs transmit at the full allowed power |0, ||8], 
OTl . [ 32 1 . We define the cooperation factor k as a number 
of BSs cooperating on transmission to each user. The k BSs 
are assigned to each user so that the corresponding channel 
norms (or, alternatively, the corresponding received SNRs) are 
the largest. 

The propagation channel between each BS's transmit anten- 
nas and mobile user's receive antenna is characterized by path 
loss, shadowing and Rayleigh fading. The path loss component 
is proportional to dZfi, where dk m denotes distance from base 
station m to mobile user fc and (3 = 3.8 is the path loss 
exponent. The channel from the transmit antenna t of the base 
station b at the receive antenna r of the fcth user is given by 


H£? } = 4^70^ (e&) (^y" (50) 

where <xu'b ~ CA/"(0, 1) represents Rayleigh fading, p^ m ^ 
is the lognormal shadow fading between &th BS and fcth user 
with standard deviation of 8 dB, and do = 1 km is the cell 
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Fig. 2. Per-cell sum-rate for a MIMO-IFC-GC with M = 3 and k = 2. 

radius. 70 is the interference-free SNR at the cell boundary. We 
consider one user randomly located per cell for the numerical 
results. 

When sectorization is employed, the transmit antennas are 
equally divided among the sectors of a cell. Each transmit 
antenna has a parabolic beam pattern as a function of the 
direction of the user from the broadside direction of the 
antenna (For more details refer to [0, ll33l ). The antenna gain 
is a function of the direction of the user fc from the broadside 
direction of the ith transmit antenna of the 6th base station 
denoted by ® kb € [— ir, 7r]; 03dB is the half -power angle and 
A s is the sidelobe gain. The antenna gain is given as 11331 

For the 3,6-sector cells A s = 20, 23 dB and e 3dB = ffg, 
respectively Q, 11331 . 11341 . When there is no sectorization we 
set A = l. 

We first compare different algorithms (for the solution of 
the SRM problem) without enforcing rank constraints on SIN, 
PWF, SDP relaxation and setting dk — mm(m tt k, mr,k) = n r 
for the eMMSEIA and DMMSE algorithms. To solve the SRM 
problem, the weight matrices in the eMMSEIA and DMMSE 
algorithms are updated at each iteration as = E^ 1 using 
the current MSE-matrix Efe. Fig.|2]compares the per-cell sum- 
rate of the algorithms discussed in this paper for a cluster 
with M = 3 cells and a cooperation factor k = 2. The 
results show that our proposed DMMSE algorithm outper- 
forms other techniques, while the polite water-filling algorithm 
(PWF) lfT4l . Ifl6l has a similar performance. Our proposed 
eMMSEIA scheme converges to a poorer local optimum value 
compared to these two schemes. The soft interference nulling 
(SIN) 02) and SDP relaxation (HI algorithms, which use 
the approximation of the non-convex terms in the objective 
function, perform worse in this example. 

In Fig. [3] we evaluate the effect of partial cooperation for 
the DMMSE, eMMSEIA, and PWF algorithms in a cluster of 
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Fig. 3. Per-cell sum-rate for a MIMO-IFC-GC with M = 5 and n 
1, 2, 3, 5, nt = 4, n r = d^ = 2, and 2 users per cell. 




Fig. 4. Per-cell sum rate of the schemes that can support dk < 

min(m t jj, m r j.) for d^ = 1, nt = 4, n r = 2, M = 3 and k = 2. 



size M = 5 where each BS is equipped with n t = 4 transmit 
antennas, each user employs n r = 2 receive antennas, and 
2 users are dropped randomly in each cell. Recall that the 
cooperation factor k represents the number of BSs cooperating 
in transmission to each user. It can be seen that as k increases 
the performance improves with diminishing returns as k grows 
large. Moreover, the relative performance of the algorithms 
confirms the considerations above. 

In Fig. [4] we compare again the performance of the schemes 
considered in Fig. [3] but with a stricter requirement on the 
number of streams, namely d^ = 1. It can be seen that the 
proposed DMMSE tends to perform better than PWF, which 
was not designed to handle rank constraints. We have adopted 
the PWF algorithm to support dk < mm(mt,k, m r,k) by using 



a thin SVD of ft. 



when computing ([36}. 



In Fig. [5] we vary the size of the cluster M, showing 
also the advantages of coordinating transmission over larger 
clusters, even when the number of cooperating BSs k is fixed. 
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Fig. 5. Per-cell sum-rate of the proposed DMMSE scheme for cluster sizes 
M = 1, 3, 7 versus the cooperation factor, k, with nt = n r = 2, SNR=20 
dB, and single-user per cell. 
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Fig. 6. CDF of the per-cell sum rates achieved by DMMSE for 5 = 1, 3, 6 
sectors per cell, M = 1,3,7 coordinated clusters, and k = 1, 2, 3 cooperation 
factors with 70 = 20 dB, nt = 6, and n r = 2. The circles represent the 
mean values of the per-cell sum-rates. 



Recall that M represents the set of BSs whose transmission 
is coordinated, but only n BSs cooperate for transmission to 
a given user. As an example, for a cluster size of M = 7 
a cooperation factor of k — 4 performs almost as well as 
the full cooperation scenario with k = 7. Moreover, the 
performance gains with respect to the non-cooperative case 
k = 1 are evident. We also show the performance with a 
cluster containing a single cell, i.e., M = 1. This highlights 
the performance gains attained even in the absence of message 
sharing among the BSs (i.e., k = 1) due to the coordination 
of the BSs within the cluster. 

Finally, the effect of sectorization is studied in Fig. [6] where 
n t = 6 transmit antennas at each BS are divided equally into 
5 = 1,3,6 sectors. Each cell contains 6 users, each equipped 
with n r — 2 receive antennas. The users are randomly located 
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at the distance of |do from its BS. For a given channel 
realization the DMMSE algorithm is used to obtain the per- 
cell sum rate. The cumulative distribution functions (CDFs) of 
per-cell sum rates are computed using large number of channel 
realizations. The gains of sectorization and cooperation are 
compared. For example, the system with coordination of 7 
cells and n = 3 cooperation factor and without sectorization 
performs better than the sectorized system with S — 6 and 
without any coordination between the BSs. 

VIII. Conclusions 

In this paper, we have studied a MIMO interference channel 
with partial cooperation at the BSs and per-BS power con- 
straints. We have shown that the channel at hand is equivalent 
to a MIMO interference channel under generalized linear 
constraints (MIMO-IFC-GC). Focusing on linear transmission 
strategies, we have reviewed some of the available techniques 
for the maximization of the sum-rate and extended them 
to the MIMO-IFC-GC when necessary. Moreover, we have 
proposed two novel strategies for minimization of the weighted 
mean square error on the data estimates. Specifically, we 
have proposed an extension of the recently introduced MMSE 
interference alignment strategy and a novel strategy termed 
diagonalized MSE-matrix (DMMSE). Our proposed strategies 
support transmission of any arbitrary number of data streams 
per user. Extensive numerical results show that the DMMSE 
outperforms most previously proposed techniques and per- 
forms just as well as the best known strategy. Moreover, our 
results bring insight into the advantages of partial cooperation 
and sectorization and the impact of the size of the cooperating 
cluster of BSs and sectorization. 

We conclude with a brief discussion on the complexity of 
the algorithms. Due to the difficulty of complete complexity 
analysis, especially in terms of speed of convergence, we 
present a discussion based on our simulation experiments. 
The PWF algorithm converges in almost the same number 
of iterations as the DMMSE algorithm. The complexity per 
iteration of PWF and DMMSE is also almost the same as 
K (0(Kn t n^) + 0(nf)) (required for the thin SVD opera- 
tion). However, the PWF algorithm contains additional op- 
erations (matrix inversion and SVD) to obtain the precoding 
matrices from the calculated transmit co variance matrices |f| 
Also, the PWF algorithm includes a water-filling algorithm 
within its inner loop, which is not required in the DMMSE 
algorithm. The eMMSEIA algorithm has lower complexity 
per iteration (i.e. KO(nf)) than the PWF and DMMSE al- 
gorithms, since its complexity is due to a matrix inversion per 
iteration per user. However, eMMSEIA converges in a larger 
number of iterations than DMMSE and PWF. The complexity 
per iteration for the SDP relaxation is higher than for the SIN 
algorithm (this is because of the extra auxiliary positive semi- 
definite matrix variable, Y, introduced in the SDP relaxation 
algorithm). The SIN algorithm also converges in a smaller 
number of iterations than the SDP relaxation algorithm. 

5 This can be performed together with finding the MMSE receive matrices. 



Appendix 
Proof of Lemma 5 

The inequality (l24l > follows from weak Lagrangian duality. 
We now prove the second part of the statement. Recognizing 
now that tr {WE} with ([T9T i is a Schur-concave function 
of the diagonal elements of (fT9]R it can be argued that the 
minimum is attained when E is diagonalized as we did for 
Lemma 4. Defining R = H H f2 _1 H, we can conclude that 
B H RB must be also diagonal in this search domain. Now 
assume that an optimal solution of the single-user WSMMSE 
problem is denoted as B. Without loss of generality we can 
assume that this solution diagonalizes the MSE matrices. The 
necessity of the KKT conditions can be proved as in 1 16] and 
in special cases such as the MIMO interference channel with 
partial message sharing of Sec. II, it also follows from linear 
independence constraint qualification conditions [ 30 1 . 

Hence, there exists a Lagrange multiplier vector A which 
together with B satisfies the KKT conditions of the WSMMSE 
problem d20b [18|[30|. As it is stated in the Lemma, we 
consider the case that A m are also strictly positive (i.e. A m > 
for all to). Simplifying the KKT condition f26l) . we hav^| 

M 

V B £ = RBEWE + Am*mB = (52) 

m— 1 

Left-multiplying ( 1521 by B H gives us 

B H RBEWE = B H A m *mJ B. (53) 

Since B H RB and correspondingly E are diagonal matrices, 
B H ^X) m ^ m ^ m ) B must also be diagonal. For simplicity, 

we introduce «&(A) = J^m=i ^m*m- Since A m > for 
every to, therefore $(A) is a non-singular matrix. This can 
be easily verified due to the structure of <!>,„. Hence, we can 
write B H *&(A)B = A where A e £, dxd j s a diagonal matrix. 
Therefore, we can write 

*(A) 1/2 B = US (54) 

where U G $yn t xd cons j s t s f orthonormal columns (i.e. 
U H U) and S G C dxd is a diagonal matrix with the diagonal 
terms of \fy[. Hence, we can write 

B = *(A)- 1/2 U£. (55) 

Replacing the structure of B given in ([55b . we can write 

B H RB = £ H U H *(A) 3R*(A) 5U£ = D (56) 

Thus, we can conclude from the equation above that U must 
contain the eigenvectors of $(A)"5R$(A) _ 2. 

6 A Schur-concave function /(x) of vector x = [x\, ...,Xd) is such that 
/(x) < /(x') if x majorizes x', that is, if 5^ =1 x [i] ^ 5Zi=i x [i] f° r a " 
j = 1, d, where xy^ (and £m) represents the vector sorted in decreasing 
order, i.e., xpj > ... > x^ (and xL, > ... > x'r^). 

7 We use differentiation rule Vxtr{AX H B} = BA and 
Vx tr {Y -1 } = — Y — 1 (VxY) Y — 1 , For the complex gradient operator 
each matrix and its conjugate transpose are treated as independent variables 

m 
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Now, plugging d55l into (l26l and left-multiply it with 3> 2 , 
we get 



rs i + rs- 



w 1 + rs 1 



= s 



(57) 



where f(A) = diag[7i(A) • • • 7d(A)] is a diagonal matrix 
with the diagonal terms of the d largest eigenvalues of 
$(A) _ 5R$(A)~2. Since all the matrices are diagonal, (l57l 
reduces to the scalar equations: 



= 1 



Solving these equations gives us the optimal pi given by 



(58) 



Pi 



W; 



1 



7i(A) 7i(A) 



(59) 



Thus, for the given Lagrange multiplier A which together with 
B, satisfying the KKT conditions of f20l) , B must satisfy ( T55T > 
and d59l . If all power constraints are satisfied with equality 
by this solution, then d55T > and ( 1591 also solves the single 
constraint problem 



minimize tr 

B 



(60) 



•|W(I + B H H H «- 1 HB) 

f 1 M _ 

subject to tr<^*(A)BB H ^< £ A m P m , 

J m=l 

The solution of this problem is given in Lemma 3 as 

B(A) = #(A)^UE (61) 

where U consists of d eigenvectors of <1>(A)~3R«I?(A) - ^ 
corresponding to its largest eigenvalues and S is a diagonal 
matrix with the diagonal elements of Jpl, which is given by 



Pk,: 



Wi 



M7( A ) 7(A) 



(62) 



for a waterfilling value of fi > which satisfies the power 
constraint 

tr {*(A)B(A)B(A) H } < £ X m P m . (63) 

rn 

On the other hand, summing up the KKT conditions 
A m (P m — tr {$ m BB H }) = for all m, we obtain that 



tr • 



^A m #Jj BB H | 



(64) 



If we set jit = 1 and comparing ( 1591 and ( 1621 . we can conclude 
that pi — pi,Vi which together with comparison of (|6T1 
and (BBl we can conclude that B(A) = B and the /i = 1 
is the optimal Lagrange multiplier of the single-constraint 
WSMMSE problem d60l . Following Lemma 4, this precoding 
matrix is also a result of minimization of the Lagrangian 
function (|23l when /1 = 1 and = 3>(A), which means 



p* = inf£(B;A). 

B 



On the other hand, we have 



maxinf £(B; A) > inf £(B; A) 

A>0 B B 



(65) 



(66) 



which in concert with d24l and 
thus concluding the proof. 



results in 



inf £(B; A) = maxinf £(B; A), 

B A>0 B 



(67) 
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